PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.4686s0015.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 418aa    MW: 47919.3 Da    PI: 6.9977
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.4686s0015.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix70.82.5e-22111213186
             trihelix   1 rWtkqevlaLiearremeerlrrgk................lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.ts 73 
                          +Wt+ +v++Li a+ +++++ + +                 +kk++W++vs++m e+gf++sp+qC++k+++lnkryk+++++ +k+ ++
  Cagra.4686s0015.1.p 111 KWTDTMVRLLIMAVFYIGDEAGLNDpidvkkksggggggmlQKKGKWKSVSRAMVEKGFSVSPQQCEDKFNDLNKRYKRVNDILGKGiAC 200
                          7**************88888886544555667788888899*********************************************9889 PP

             trihelix  74 essstcpyfdqle 86 
                          +++++  +++ ++
  Cagra.4686s0015.1.p 201 RVVENQGLLEGMD 213
                          9999999988887 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138373.4E-20109239No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010629Biological Processnegative regulation of gene expression
GO:1900037Biological Processregulation of cellular response to hypoxia
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 418 aa     Download sequence    Send to blast
MESNVMFSGF SPRMLSLEMP QNPQNPQNSV QFQHPHPYTS TSADQQTQPP MIKPLYPYAS  60
APPSKPKQLS PMSGCNGDDE DRGSGSGSGC NPDDSAGTDG KRKLSQWHRM KWTDTMVRLL  120
IMAVFYIGDE AGLNDPIDVK KKSGGGGGGM LQKKGKWKSV SRAMVEKGFS VSPQQCEDKF  180
NDLNKRYKRV NDILGKGIAC RVVENQGLLE GMDHLTPKLK DEVKKLLNSK HLFFREMCAY  240
HNSCGHLGGH DQPPQQSPVT IPVQQNCFHA AEAGKMARIA EREEAEEDVE SDMAEDSESE  300
MEESEEEEEE ETTKKKRRVL TTAVKRLREE AARVVEDVGK SVWEKKEWMR RKMLEIEEKK  360
IGYEWEGVEM EKQRVKWMRY RSKKEREMEK AKLDNQRRRL ETERMVLMLR RSEIELNE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1313317KKKRR
2313318KKKRRV
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankATAC0109270.0AC010927.5 Arabidopsis thaliana chromosome III BAC T22K18 genomic sequence, complete sequence.
GenBankAY0910280.0AY091028.1 Arabidopsis thaliana unknown protein (At3g10040) mRNA, complete cds.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006299896.10.0hypothetical protein CARUB_v10016104mg
TrEMBLR0HSJ60.0R0HSJ6_9BRAS; Uncharacterized protein
STRINGAT3G10040.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM67622741
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10040.10.0sequence-specific DNA binding transcription factors